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INTRODUCTION 


Metastasis  is  one  of  the  biggest  problems  in  cancer  therapy.  E-cadherin  (CDH1)  is 
known  to  be  involved  in  epithelial-mesenchymal  transition  (EMT)  process,  thus  playing 
an  important  role  during  the  tumor  metastatic  progression  [1-3].  It  is  reported  that  losing 
CDH1  expression  in  conjunction  with  EMT  might  contribute  to  the  development  of 
multidrug  resistance  (MDR)  to  chemotherapy  [4-6].  Thus,  understanding  the  CDH1 
down-regulation  mediated  EMT  and  its  relationship  with  MDR  will  have  great  benefit  for 
improving  chemotherapy  efficiency.  Most  of  the  present  in  vitro  researches  analyzing  the 
functions  of  CDH1  and  its  involvement  in  mediating  metastasis  have  been  carried  out  in 
2D  cultured  cell  lines.  However,  it  has  been  reported  that  cells  in  2D-culture  systems 
have  different  gene  expression  patterns  than  in  vivo  cells  [7-11].  By  comparison,  the  3D 
culture  systems  provide  more  accurate  molecular  pictures  of  human  disease.  The 
establishment  of  3D-culture  systems  will  be  a  better  way  to  mimic  the  tissue  samples  and 
provide  us  with  more  information  that  recapitulates  human  tissues. 

In  this  research,  we  aim  to  establish  a  3D  model  that  can  simulate  the  in  vivo 
cancer  models  and  can  be  induced  to  down  regulate  its  CDH1  expression  when 
stimulated,  and  thus  can  be  used  for  simulation  of  EMT  processes.  We  selected  CDH1  as 
a  target  for  manipulation  because  its  expression  is  known  to  cause  pronounced  chemical 
and  phenotypic  changes  in  epithelial  cells.  The  CDH1  knock  down  needs  to  be  inducible 
because  expression  of  CDH1  is  essential  while  forming  the  3D  structures  and  the  CDH1 
will  be  knocked  down  after  the  formation  of  3D  structures.  We  hypothesize  that  by 
reducing  the  expression  of  CDH1  in  colorectal  cancer  cell  lines  cultured  in  3D-structures, 
the  colorectal  cancer  cells  will  display  EMT  characteristics  and  gain  drug-resistance 
compared  to  CDH1  (+)  colorectal  cancer  cells.  We  hypothesize  that  CDH1 
reduction  will  induce  changes  in  downstream  pathways,  including  proteins  in  pathways 
of  cell  survival  (PI3K,  GSK3[3,  EGFR)  apoptosis  (caspases  family  proteins,  Bcl-2  family 
proteins),  and  cell  migration  (Rho  family  proteins  like  Cdc42,  Racl,  RhoA,  WASP 
family  proteins,  and  PAR/aPKC  complexes).  Our  research  will  be  the  first  to  connect 
EMT  to  drug  resistance  in  3D-culture  systems. 

During  the  first  year  of  this  research,  we  put  our  effort  mainly  into  establishing 
and  optimizing  the  inducible  CDH1  knock  down  model  with  3D  culturing  system,  and 
evaluating  its  characteristics  and  performance.  We  successfully  generated  two  inducible 
CDH1 -knock-down  cell  lines  (HT29-Green  and  HT29-Blue)  from  the  parental  colorectal 
cancer  cell  line  HT29,  which  can  express  shRNAs  targeting  to  CDH1  under  the  treatment 
of  doxycyclin.  These  two  cell  lines  together  with  one  negative  control  cell  line  (HT29- 
Vector)  which  contains  non-specific  targeting  shRNA  are  used  to  establish  a  model  for 
EMT  study  in  3D-culture  system.  The  optimized  conditions  for  generating  3D  structures 


were  determined  by  monitoring  the  morphologies  of  3D  structures  formed  with  12 
different  seeding  densities  on  1%  agarose  coated  96-well  plates.  The  diameters  of  3D 
structures  were  recorded  every  day  to  monitor  their  growth.  Cells  were  allowed  to  form 
3D  structures  without  doxycyclin  treatment,  and  after  3  days  when  the  3D  structure 
formed,  doxycyclin  was  added  to  induce  CDH1  knock-down.  The  expression  levels  of 
CDH1  in  3D  structures  formed  with  HT29- Vector,  HT29-Green  and  HT29-Blue  were 
compared  with  real  time  PCR  to  confirm  the  knock  down  of  CDH1.  The  morphology 
changes  of  CDH1 -knock-down  models  were  confirmed  with  scanning  electron 
microscopy  (SEM)  and  the  expression  of  CDH1  was  confirmed  with  confocal 
microscopy.  Boyden  chamber  assays  were  carried  out  to  confirm  the  increase  of 
metastatic  ability  in  CDH1 -knock-down  3D  structures.  The  successfully  established  3D 
EMT  model  will  be  used  for  further  downstream  pathway  analysis,  and  is  expected  to 
provide  more  details  on  how  CDH1  functions  during  metastatic  process. 


MAIN  APPROACHES  DURING  YEAR  ONE 

1.  Successfully  established  CDH1 -knock-down  (CDH1-KD)  HT29  cell  lines 
(9/1/2012-12/31/2012). 

We  have  successfully  established  CDH-1  knockdown  cells  lines  in  HT29s.  In  our 
proposal,  we  had  planned  to  use  lipid-based  approaches,  however,  found  that  lentiviral 
delivery  to  be  more  effective.  We  had  also  intended  to  generate  knockdown  lines  in  both 
HT29  and  HCT116;  however,  only  HT29  was  amenable  to  transfection. 

As  stated  in  our  proposal,  we  originally  expected  to  have  the  CDH1-KD  cell  lines 
(HT29  and  HCT116)  established  by  the  time  we  begin  the  project.  However,  unlike  the 
MCF7  cells  which  we  successfully  established  with  FuGENE  lipid-based  transfection 
strategy,  the  HT29  and  HCT116  cells  showed  very  low  transfection  efficiency.  After 
struggling  for  several  months  with  the  FuGENE  and  lipofectamine  2000  transfection  with 
no  success,  we  changed  our  strategy  and  start  over  with  the  cell  line  establishment  when 
the  whole  project  began.  Since  the  lipid-based  strategy  didn’t  work,  we  decided  to  use  the 
lentiviral  system  to  pack  the  vectors  and  infect  the  cell  lines  with  the  lentiviral  particles  to 
generate  the  inducible  CDH1-KD  cell  lines. 

As  stated  in  our  proposal,  5  types  of  plasmids  were  generated,  including  three 
CDH1-KD  vectors,  which  we  refer  to  as  Gold,  Blue,  and  Green,  which  contain  shRNA 
sequences  that  target  different  part  of  CDH1  (Table  1),  one  negative  control  vector 
described  as  Vector  which  contains  an  shRNA  sequences  with  no  specific  targets,  and 
one  positive  control  vector  that  targets  GAPDH.  For  the  quality  control  of  the  5  types  of 
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pTRIPZ  vectors,  they  were  treated  with  restriction  enzyme  Sail.  All  5  purified  vectors 
showed  1  intact  band  without  Sail  treatment  and  3  bands  with  Sail  treatment  (Figure  1A), 
indicating  that  the  quality  of  the  vectors  are  good.  To  further  confirm  the  precise  shRNA 
sequence,  all  5  vectors  were  sequenced  and  the  sequencing  results  were  compared  with 
the  theoretical  shRNA  sequences.  The  alignment  results  showed  that  all  the  shRNAs  that 
target  CDH1  were  100%  precise  (Figure  IB). 


Table  1. shRNA  sequences  for  CDH1  knock  down. 


Commercial  # 

Artificial  Name 

Hairpin  sequences 

Targets 

V2THS_14834 

Blue 

tgctgttgacagtgagcgacctgttggtgtctttattat™ 

ACAGA7G7A  AATAATAAAGACACCAACAGGGTGCCTACTGCCTCGGA 

726-744 

V2THS_14838 

Gold 

TGCTGTTGACAGTGAGCGCCGTCGTAATCACCACACTGAA7AG7-GAAGC 

CACAGA7G7ATTCAGTGTGGTGATTACGACGTTGCCTACTGCCTCGGA 

1295-1313 

V2THS_243170 

Green 

TGCTGTTGACAGTGAGCGACGAGAGAGTTTCCCTACGTATrAGrGAAGCC 

ACAGATGTA  ATACGTAGGGAAACTCTCTCGGTGCCTACTGCCTCGGA 

1128-1146 

Red:  shRNA  sense;  Blue:  shRNA  anti-sense;  Green:  mir-30  loop 

The  lentiviral  infection  procedure  is  shown  in  Figure  1C.  293FT  cells  were  first 
transfected  with  5  pieces  of  lentiviral  particles  together  with  different  types  of  pTRIPZ 
vectors.  The  vectors  are  packed  into  the  lentiviral  particles.  A  green  fluorescence 
expression  plasmid  was  also  co-transfected  into  293FT  cells  to  evaluating  the  transfection 
efficiency  based  on  the  green  fluorescence.  Because  it  doesn’t  contain  the  lentivial 
packing  sequence,  it  will  not  affect  the  packing  process  of  our  aim  vectors.  The 
transfection  efficiency  of  293FT  is  shown  in  Figure  ID,  evaluated  based  on  expression  of 
green  fluorescence.  The  data  in  this  figure  shows  that  the  lentiviral  particles  and  plasmids 
were  transfected  into  293FT  cells  with  very  high  efficiency. 
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Figure  1.  Confirmation  of  vector  purification  and  lenti-virus  packaging. 
(A)  Purified  vectors  were  digested  with  Sail  showing  3  bands.  (B)  Blast 
of  shRNA  sequences  incorporated  in  the  purified  vectors  against  the 
theoretical  sequences.  All  shRNA  sequences  showed  100%  alignment 
with  the  theoretical  sequences.  (C)  Brief  procedure  for  lenti-virus 
packaging.  (D)  Lenti-virus  packaging  in  293  FT  cells.  A  vector 
expressing  GFP  were  co-transfected  into  the  293FT  cells  to  indicate  the 
transfection  efficiency. 


After  successfully  generating  lentiviral  particles  containing  the  5  different  types 
of  pTRIPZ  vectors,  the  lentiviral  particles  were  purified  and  used  for  infecting  two 
colorectal  cancer  cell  lines  (HT29  and  HCT116).  For  each  cell  line  and  each  type  of 
pTRIPZ  vector,  we  infected  the  cells  with  two  different  batches  of  lentiviral  particles,  and 
for  the  first  batch  of  lentiviral  particles,  3  different  tilter  amounts  (1  ml,  0.5  ml,  and  0.25 
ml)  were  tested  in  parallel  (Table  2).  In  summary,  10  different  cell  lines  were  generated 
under  40  different  conditions  and  screened  with  puromycin  for  at  least  2  weeks  to 
establish  stable  pTRIPZ  containing  cells. 


Table  2.  Lentiviral  infection  conditions. 


HT29 

HCT116 

Virus-Batch  1 

Batch  2 

Virus-Batch  1 

Batch  2 

Tilterl 

Tilter2 

Tilter3 

Tilterl 

Tilterl 

Tilte  r2 

Tilte  r3 

Tilterl 

Vector 

1 

1 

1 

1 

1 

1 

1 

1 

GAPDH 

1 

1 

1 

1 

1 

1 

1 

1 

Green 

1 

1 

1 

1 

1 

1 

1 

1 

Blue 

1 

1 

1 

1 

1 

1 

1 

1 

Gold 

1 

1 

1 

1 

1 

1 

1 

1 

Total:  40  infection  conditions 


Doxycyclin  was  then  added  to  these  established  cell  lines  to  induce  the  expression 
of  TurboRFP  together  with  shRNAs.  Red  fluorescence  was  observed  with  each  cell  line. 
For  modified  HT29  cell  lines,  red  fluorescence  was  observed  in  most  of  the  infection 
conditions  (Figure  2).  In  addition  to  expressing  strong  fluorescence  intensity,  the  “Blue 
(Bl)”  and  “Green  (Gr)”  CDH-1  knockdown  cells  showed  altered,  rounded  morphology 
(Figure  2,  Rows  3  &  4).  On  the  contrary,  for  modified  HCT116  cell  lines,  weak  red 
fluorescence  was  only  observed  in  several  conditions  and  the  fluorescence  decreased  with 
each  passage  (data  not  shown). 

The  down-regulation  of  CDH1  was  confirmed  with  RT-PCR.  For  HT29  cell  lines, 
HT29-Green  and  HT29-Blue  showed  the  best  CDH1  knock-down,  while  GAPDH 
expression  was  unchanged  (Figure  3A).  While  for  HCT116  cell  lines,  HCT1 16-Green 
and  HCT1 16-Blue  showed  only  mild  down-regulation  of  CDH1,  and  HCT1 16-Gold 
showed  no  CDH1  knock-down  effects  (Figure  3B).  These  results  are  in  accordance  with 
the  TurboRFP  observation.  Especially  for  the  HT29-Green  and  HT29-Blue  cells,  after 
inducing  CDH1-KD  shRNA  expression,  cells  showed  an  altered,  rounded  morphology, 
compared  with  HT29-Vector,  HT29-GAPDH  and  HT29-Gold  cells  (Figure  2).  In 
addition,  different  virus  tilter  seemed  to  have  no  obvious  effect  on  CDH1  knock-down 


efficiency  (Figure  3A.  Green  and  Blue).  As  a  result  of  these  observations,  only  HT29- 
Green  and  HT29-Blue  with  virus- 1  were  chosen  for  later  experiments  and  HT29- Vector 
was  used  as  negative  control  for  CDH1-KD. 


Figure  2.  Morphology  of  HT29  cells  after  infection  with  five  types  of  lentiviral 
particles  and  screening  with  puromycin  for  14  days.  The  cells  that  survived  in 
puromycin  screening  were  treated  as  shRNA/TurboRFP  expressing  cells  and 
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induced  to  express  shRNA/TurboRFP  by  adding  lpg/ml  doxycyclin.  Red 
fluorescence  indicates  the  expression  of  Turbo  RFP,  which  also  indicates  the 
expression  of  shRNAs.  Morphology  changes  can  be  observed  in  CDH1-KD 
shRNA  expression  cells. 


Figure  3 
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Figure  3.  Confirmation  of  knock-down  of  CDH1  RNA  expression  in  2D  culturing 
system  with  RT-PCR.  (A)  HT29  cell  lines  were  analyzed  with  different  virus 
amounts  containing  different  shRNAs.  (B)  HCT116  cell  lines  were  analyzed  for 
expression  of  CDH1  and  GAPDH. 


2.  (Task  la)  Optimize  culture  conditions  for  forming  3D  structures.  (12/1/2013- 
2/28/2013) 


Non-transfected  HT29  and  HCT116  cells  were  first  used  for  optimization  of 
conditions  to  form  3D  structures.  For  each  cell  line,  12  different  seeding  densities  were 
tested,  as  shown  in  Figure  4A  and  Figure  4C.  After  seeding  onto  the  agarose  coated  96- 
well  plates,  cells  will  aggregate  at  the  round  bottom  of  the  wells  and  3D  cell  structures 
can  be  observed  from  day  2.  Because  on  Day  2  most  of  the  cells  haven’t  proliferated  yet 
and  the  proliferation  of  seeded  cells  can  be  neglected,  the  volume  of  the  3D  structures  can 
be  approximately  treated  as  similar  to  the  seeded  cell  numbers.  We  measured  the 
diameter  of  the  3D  structures  on  Day  2  and  both  HT29  and  HCT116  cell  lines  showed 
good  agreement  between  the  measured  diameters  and  the  cube  roots  of  correspondent 
seeding  densities  (Figure  4A  and  4C),  with  R2  of  0.996  for  HT29  cells  and  0.997  for 
HCT116  cells.  After  the  initiation  of  3D  structure  formation,  cells  within  the  3D 
structures  began  to  proliferate  and  the  3D  structure  increased  in  size.  To  monitor  their 
proliferation,  we  acquired  optical  images  of  the  3D  structures  every  24  hours  from  day  2 
to  day  24,  and  every  48  hours  from  day  24  to  day  36  for  HT29  cells,  and  to  day  34  for 
HCT116  cells.  With  each  seeding  density,  we  observed  10  different  wells  to  account  for 
biological  variation  and  minimize  recording  errors.  We  generated  3,480  pictures  for  the 
HT29  cell  cultures  and  3,360  pictures  for  HCT116  cell  cultures.  The  3D  structure 
diameters  were  measured  for  each  picture  and  the  results  were  analyzed  with  MATLAB. 
The  figures  of  diameters  of  3D  structures  verses  the  number  of  days  in  culture  were 
generated  as  shown  in  Figure  4B  and  4D.  The  numerical  diameters  and  their  standard 
deviations  are  reported  in  Table  3  (HT29  cells)  and  Table  4  (HCT116  cells).  As  shown  in 
Figure  4B,  for  HT29  cells,  when  seeding  densities  are  lower  than  15,000  cells/well,  cells 
grow  for  the  first  several  days  with  different  growth  rates.  However,  for  those  with 
seeding  densities  higher  than  15,000  cells/well,  the  formed  3D  structures  do  not  increase 
in  diameter  with  culturing  time.  All  the  3D  structures  with  different  seeding  densities 
become  similar  size  at  day  15.  Between  days  25  and  30,  the  size  of  the  cultures  remain 
constant  for  many  seeding  densities.  However,  after  transfer  to  a  new  agarose  plate,  they 
began  to  grow  again  (Figure  4B  arrow). 

As  shown  in  Figure  4D,  for  HCT116  cells,  the  3D  structures  showed  similar 
characteristics  as  HT29  cells,  and  all  3D  structures  with  different  seeding  densities 
showed  similar  size  at  day  13.  Then  the  3D  structures  stopped  growing  and  those  with 
higher  seeding  densities  began  to  collapse  (Figure  4D).  The  morphologies  of  the  3D 
structures  formed  with  different  seeding  densities  are  shown  in  Figure  4E  for  HT29  cells 
and  Figure  4F  for  HCT116  cells.  For  the  higher  seeding  densities,  the  3D  structures 
appeared  to  be  less  viable  as  early  as  day  10.  To  confirm  that  the  HT29-Green,  HT29- 


Blue  and  HT29- Vector  cells  have  similar  proliferation  properties  as  the  non-transfected 
HT29  cells,  the  same  density  of  these  cells  were  seeded  and  monitored  every  day.  Their 
morphologies  at  day  7  were  shown  in  Figure  5.  They  did  not  show  much  difference 
during  the  proliferation  process.  To  keep  the  cells  viable  and  proliferating  during 
experiments,  we  chose  the  seeding  density  of  3000  cells/well  and  kept  the  overall 
culturing  time  within  15  days.  The  one  exception  is  for  the  SEM  observation,  when  for 
manipulation  convenience,  we  used  the  seeding  density  of  7000-9000  cells/well. 
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Fig  4.  Optimization  of  the  conditions  to  form  3D  structures.  (A)  Relationship  of 
seeding  densities  and  diameters  of  3D  structures  of  HT29  cells.  (B)  Growing  curve 
of  HT29  cells  seeded  with  different  seeding  densities.  The  growth  of  3D  structures 
is  evaluated  by  their  diameters.  (C)  Relationship  of  seeding  densities  and  diameters 
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of  formed  3D  structures  of  HCT116  cells.  (D)  Growing  curve  of  HCT116  cells 
seeded  with  different  seeding  densities.  The  growth  of  3D  structures  is  evaluated 
by  their  diameters.  (E,  F)  Pictures  of  3D  structures  on  different  culturing  days  with 
different  seeding  densities.  (E)  HT29  cells,  and  (F)  HCT1 16  cells. 


Figure  5 
A 


Fig  5.  Morphology  of  3D  structures  formed  by  HT29-Vector,  HT29-Green,  HT29- 
Blue  and  non-transfected  HT29  cells  at  seeding  density  of  8000  cells/well  on  day  7. 


3.  (Task  lb)  Optimize  conditions  for  inducing  CDH1  knock-down  in  3D  system. 

(3/1/2013-5/31/2013) 

In  Figure  3,  we  showed  the  CDH1  knock-down  in  2D  culture  system.  Since  the 
3D  culture  system  is  different  from  the  2D  culture  system,  we  further  optimized  the 
conditions  for  inducing  CDH1  knock-down  in  the  3D  culture  system.  The  amounts  of 
doxycyclin  for  inducing  CDH1-KD  shRNA  expression  were  first  optimized.  Six  different 
doxycyclin  concentrations  were  evaluated  (with  final  concentrations  of  1  pg/ml,  2  pg/ml, 
4  pg/ml,  8  pg/ml,  and  12  pg/ml).  The  3D  structures  without  adding  doxycyclin  (0  pg/ml) 
were  used  as  a  control  group.  After  4  days  of  treatment  with  doxycyclin,  the  3D 
structures  in  groups  with  high  doxycyclin  concentrations  (8  pg/ml  and  12  pg/ml)  began 
to  disintegrate  so  that  they  were  discarded  without  an  evaluation  by  RT-PCR  (data  not 
shown).  The  toxicity  of  doxycycline  at  high  concentrations  might  be  the  reason  for  the 
disintegration.  As  a  result,  only  3D  structures  treated  with  doxycyclin  concentrations  less 
than  or  equal  to  4  pg/ml  were  further  analyzed  for  CDH1  expression  with  RT-PCR.  The 
CDH1  expression  in  HT29-Green  and  HT29-Blue  were  normalized  to  that  in  HT29- 
Vector  treated  with  the  correspondent  doxycyclin  concentrations  (Figure  6A).  The  results 
indicated  that  treatment  with  1  pg/ml  of  doxycyclin  is  sufficient  to  induce  CDH1  down- 
regulation  at  RNA  level,  and  further  increasing  the  doxycyclin  concentration  did  not 
show  much  improvement  on  CDH1  knock-down  efficiency  (Figure  6A). 

For  the  next  step,  the  doxycyclin  treatment  time  was  also  optimized.  3D  structures 
treated  with  2  pg/ml  doxycyclin  for  0  day,  1  day,  2  days,  3  days,  and  4  days  were 


collected  separately  and  checked  for  their  CDH1  mRNA  expression  (Figure  6B).  The 
CDH1  expression  in  HT29-Green  and  HT29-Blue  were  normalized  to  that  in  HT29- 
Vector  treated  with  doxycyclin  for  the  correspondent  times.  The  results  indicated  that  by 
2  days  of  treatment  with  2  pg/ml  of  doxycyclin,  the  CDH1  expressions  are  down- 
regulated  for  more  than  50%  at  RNA  level,  and  the  decrease  of  CDH1  expression  did  not 
show  many  changes  by  further  increasing  the  doxycyclin  treatment  time  (Figure  6B). 

With  the  above  optimization,  considering  the  possible  delay  from  RNA 
expression  to  protein  expression,  2  pg/ml  of  doxycyclin  and  treatment  for  3  days  were 
selected  for  later  experiments. 
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Figure  6.  Optimization  of  doxycyclin  treatment  conditions  for  inducing  CDH1 
down  regulation  in  3D  culturing  system.  (A)  RT-PCR  results  for  optimization  of 
doxycyclin  treatment  dose.  (B)  RT-PCR  results  for  optimization  of  doxycyclin 
treatment  time. 


4.  (Task  lc)  Confirm  morphological  cell  response  to  reduced  CDH1  expression. 
(6/1/2013-8/31/2013) 


Morphological  changes  induced  by  CDH1  down-regulation  was  first  confirmed 
with  scan  electron  microscope  (SEM).  HT29-Vector,  HT29-Blue  and  HT29-Green  cells 
were  seeded  at  the  density  of  8000  cells/well  and  cultured  for  10  days  followed  with  3 
days  treatment  with  2  pg/ml  doxycyclin.  This  experimental  condition  was  selected 
because  at  day  13  with  8000  cells/well  seeding  density,  the  3D  structures  are  big  and 
sturdy  enough  to  be  manipulated  for  SEM  sample  preparation.  Without  doxycyclin 
treatment,  HT29-Blue  and  HT29-Green  3D  structures  showed  no  morphological 
differences  compared  with  HT29-Vector  3D  structures  (Figure  7A,  B,  E,  F,  I  and  J).  Cells 
on  the  surface  of  the  3D  structures  were  smoothly  integrated  with  each  other  and 
individual  cells  are  not  clearly  defined.  However,  with  doxycyclin  treatment  for  3  days, 
HT29-Blue  and  HT29-Green  3D  structures  showed  very  clear  morphological  changes  to 
the  surface  cells  (Figure  7G,  H,  L  and  K),  having  less  integration  with  each  other  and 
clearer  individual  cell  structures.  In  contrast,  the  HT29-Vector  3D  structures  are  not 
visibly  altered  with  doxycyclin  treatment  (Figure  7C  and  D).  These  results  suggest  that 
CDH1  knock-down  with  shRNA  induced  by  doxycyclin  treatment  will  cause  reduced 
integration  of  the  surface  cells,  potentially  leading  to  migration. 


Dox  (+)  2500x  Dox  (+)  500x  Dox  (-)  2500x  Dox  (-)  500x 


Figure  7 


Figure  7.  Surface  morphology  of  HT29-Vector,  HT29-Blue  and  HT29-Green  3D 
structures  were  confirmed  with  SEM.  The  intact  spheroid  structures  are  shown  in 
Rows  1  &  3,  while  the  surface  of  the  spheroids  are  shown  in  Rows  2  &4.  Note  the 
changes  in  the  surface  cells  that  occur  for  the  Blue  and  Green  cell  lines  with  the 
additional  of  Doxycycline  (Row  4,  H  and  J). 


A  Boyden  chamber  assay  was  then  performed  to  quantitatively  evaluate  the 
migratory  of  different  3D  structures  with  or  without  doxycyclin  treatment.  The  number  of 
cells  migrated  from  the  upper  chamber  to  the  lower  chamber  were  evaluated  by  the 
fluorescence  intensity.  All  data  were  normalized  to  HT29-Vector  3D  structures  with 
corresponding  doxycyclin  treatment  conditions.  The  results  showed  that  with  doxycyclin 
treatment,  more  migrating  cells  were  detected  in  HT29-Blue  and  HT29-Green  3D 
structures  (Figure  8A).  However,  because  the  3D  structures  easily  stick  on  the  upper 
chamber  surface  and  are  difficult  to  remove,  only  one  of  the  three  replicates  attempted 
succeeded.  We  are  currently  working  to  replicate  this  experiment  as  well  as  searching  for 
other  methods  that  are  easier  to  perform. 

Immuno staining  of  CDH1  and  observation  with  confocal  microscope  was 
originally  proposed  to  confirm  the  expression  of  CDH1  with  or  without  doxycyclin 
treatment,  and  cells  at  inner  part  of  3D  structures  were  expected  to  be  observable  via 
confocal  microscopy.  However,  when  we  tried  the  experiment  with  3D  cultures  of  HT29- 
Green,  as  shown  in  Figure  8B,  with  increase  of  frame  numbers  (indicating  the  distance 
from  the  bottom  of  3D  structures,  with  the  lowest  surface  which  contact  with  bottom 
cover  glass  referred  to  as  Frame  1),  the  staining  signal  of  the  inner  part  became  weaker 
and  weaker.  We  consider  the  possible  reasons  might  be  as  follows:  1)  the  staining  process 
is  not  long  enough  to  allow  the  antibodies  and/or  DAPI  to  penetrate  into  the  inner  part  of 
the  3D  structures  (we  can  only  observe  up  to  2-3  layers  of  cells  from  the  surface);  and  2) 
the  setting  of  the  confocal  laser  is  not  powerful  enough  to  give  us  information  of  the  inner 
part  of  the  3D  structures.  The  Notre  Dame  imaging  facility  is  recently  adjusting  and 
optimizing  a  multiphoton  confocal  instrument.  Once  it  is  available  for  our  use  (within  the 
next  2-3  weeks),  we  will  repeat  these  experiments  with  the  multiphoton  settings.  With  the 
improved  instrument  settings,  we  hope  to  obtain  images  throughout  the  3D  structures. 


Figure  8 
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Figure  8.  Confirmation  of  cell  migration.  (A)  Boyden  chamber  assay  of  3D 
structures  with  or  without  doxycyclin  treatment.  (B)  Confocal  images  of  3D 
structures  immunostained  for  CDH1  (green  fluorescence)  and  nuclear  (DAPI,  blue 
color). 


5.  (Task  2a)  Establish  the  SILAC  method.  (7/1/2013-8/30/2013) 


During  year  2,  we  will  perform  quantitative  experiments  to  study  the  molecular 
changes  of  3D  structures  in  response  to  CDH1-KD  induced  drug  resistance.  We  will  use 
the  SILAC  label  strategy  to  label  3D  structures  under  different  conditions,  and  then  mix 


the  samples  together  and  perform  LC-MS/MS  analysis.  To  prepare  for  the  quantitative 
analysis,  we  first  analyzed  the  labeling  efficiency  of  the  SILAC  method.  SILAC  labels 
were  checked  both  in  2D  and  3D  culturing  systems.  The  results  show  that  in  both  2D  and 
3D  culturing  systems,  for  the  cells  cultured  in  media  supplemented  with  heavy  amino 
acids,  around  98%  of  peptides  were  labeled  with  heavy  amino  acids.  For  the  cells 
cultured  in  media  supplemented  with  light  amino  acids,  more  than  99.8%  of  the  peptides 
have  light  amino  acids,  and  the  presence  of  heavy  amino  acids  is  less  than  0.2%  (Figure 
8A).  The  protein  amounts  in  each  sample  were  determined  by  BCA  assay.  However, 
because  the  BCA  assay  can  sometimes  generate  some  error,  to  ensure  that  in  the  real 
experiments  the  light  and  heavy  samples  can  be  mixed  with  exactly  1:1  ratio,  a  pre¬ 
experiment  was  performed  to  determine  the  actual  heavy-to-light  mixing  ratio  (vol/vol). 
The  heavy-labeled  3D  structures  and  light-labeled  2D  layers  were  mixed  with  different 
volume  ratios  to  make  1:1  total  protein  ratio  determined  by  BCA  assay  (Figure  9B).  Six 
samples  were  generated  with  different  vol/vol  ratios  and  tested  with  LC-MS/MS  with  60 
min  run  (blue  line  in  Figure  9B)  and  120  min  run  (red  line  in  Figure  9B).  The  heavy-to- 
light  ratio  determined  by  LC-MS/MS  showed  linear  correlation  to  the  vol/vol  ratios 
determined  by  BCA  assay.  The  light-labeled  3D  structures  and  heavy-labeled  2D  labeled 
were  tested  with  the  same  strategy  and  showed  similar  results  (Figure  9C).  From  these 
linear  correlations  we  determined  the  actual  vol/vol  ratio  in  order  to  assess  the  peptide 
heavy-to-light  ratio  to  1:1  for  both  60  min  and  120  min  runs.  We  will  use  the  average  of 
these  two  numbers  to  make  the  heavy  and  light  sample  mixtures. 

Beside  this  optimization,  during  this  year,  we  also  developed  a  new  method  for 
phosphoproteomic  studies.  Compared  with  the  traditional  and  widely  used  methods  in 
phosphoproteome  field,  our  optimized  method  significantly  decreases  the  required 
sample  amount  from  15mg  to  3mg,  without  losing  detection  sensitivity.  This  approach  is 
published  in  Journal  of  Proteome  Research.  With  this  approach,  it  will  be  possible  to 
include  the  studies  on  phosphoproteomic  changes  caused  by  CDH1  knock  down  as  well, 
which  will  further  enrich  our  second  year  of  research. 


Figure  9 
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Figure  9.  SILAC  labeling  efficiency.  (A)  SILAC  heavy  and  light  amino  acid  label 
efficiencies  are  shown  in  bar  graph.  (B,  C)  Validation  of  the  ratio  (vol/vol)  to  mix 
heavy  and  light  samples  to  make  1:1  SILAC  mixtures.  (B)  is  for  the  mixture  of 
heavy-labeled  3D  structures  and  light-labeled  2D  layers,  and  (C)  is  for  the  mixture 
of  light-labeled  3D  structures  and  heavy -labeled  2D  layers. 


6.  (Task  Id)  Cellular  responses  with/without  reducing  CDH1  expression.  (8/1/2013- 
9/30/2013) 


Because  of  the  delay  of  the  CDH1-KD  cell  line  establishment,  we  just  began  the 
experiments  for  this  section  of  the  project. 


KEY  RESEARCH  ACCOMPLISHMENTS 

1.  The  colorectal  cancer  cell  lines  which  can  inducibly  knock  down  CDH1  expression 
were  successfully  established. 

2.  Successfully  established  the  3D  model  for  researching  CDH1  down  regulation  related 
biological  processes. 

3.  Successfully  optimized  the  conditions  for  inducing  CDH1  down  regulation  in  3D 
systems. 

4.  Successfully  observed  the  morphological  changes  of  the  3D  structures  when  inducing 
CDH1  knock  down. 

5.  Successfully  optimized  SILAC  conditions. 

6.  Established  a  new  method  for  phosphoproteomic  studies. 


REPORTABLE  OUTCOMES 
Manuscripts: 

Yue  XS,  Hummon  AB.  Combination  of  Multistep  IMAC  Enrichment  with  High -pH 
Reverse  Phase  Separation  for  In-Depth  Phosphoproteomic  Profiling.  J  Proteome  Res. 
2013  Sep  6;12(9):4176-86. 

Presentations: 

Yue  XS,  Hummon  AB.  Reducing  Sample  Size  and  Improving  Numbers  of  Identified 
Proteins:  Phosphoproteomic  Studies  with  SCX-IMAC  and  IMAC-RP  Methods 
(American  Society  for  Mass  Spectrometry  conference;  Minneapolis,  MN;  June,  2013) 


Yue  XS,  Hummon  AB.  3D  Model  for  Studying  Roles  of  CDH1  in  EMT  and  Metastasis 
(Walther  Cancer  Foundation  Symposium;  Notre  Dame  IN,  October  4-5,  2013) 

Yue  XS,  Hummon  AB.  IMAC-RP  method  for  Reducing  Sample  Size  and  Improving 
Numbers  of  Identified  Proteins  in  Phosphoproteomic  Studies  (Midwestern 
Universities  Analytical  Chemistry  Conference;  Notre  Dame  IN,  October  10-12,  2013). 

Yue  XS,  Hummon  AB.  3D  Model  for  Studying  Roles  of  CDH1  in  EMT  and  Metastasis 
(Midwestern  Universities  Analytical  Chemistry  Conference;  Notre  Dame  IN,  October 
10-12,  2013). 


Development  of  cell  lines: 

Developed  the  cell  line  set:  HT29-Vector,  HT29-Green,  HT29-Blue  for  evaluation  of 
CDH1  knock  down  effects. 


CONCLUSION 

In  the  first  year  of  research,  we  successfully  established  the  inducible  CDH1 
knock-down  cell  lines  with  HT29  cells  (HT29-Green,  HT29-Blue,  and  HT29-Vector  as 
negative  control),  and  used  these  cell  lines  to  establish  the  3D  models  for  investigating 
CDH1  knock-down  induced  EMT.  We  optimized  the  conditions  for  initiating  3D 
structure  formation,  and  found  the  best  conditions  for  inducing  CDH1  knock-down  in  3D 
structures.  The  optimal  conditions  are  3000  cells/well  seeding  density,  culturing  for  less 
than  10  days,  inducing  with  2  pg/ml  of  doxycyclin  for  3  days.  The  CDH1  down 
regulation  was  confirmed  with  RT-PCR  analysis,  and  the  cell  morphological  changes 
were  confirmed  with  SEM  and  confocal  microscopy  observation.  The  increase  of  cell 
migration  when  down  regulating  CDH1  was  confirmed  with  Boyden  Chamber  assay.  We 
also  optimized  the  SILAC  methods  and  developed  a  new  method  for  phosphoproteomic 
studies,  which  was  published  in  Journal  of  Proteome  Research. 

In  conclusion,  during  the  first  year,  we  accomplished  most  of  the  work  we 
proposed  in  our  original  proposal,  and  finished  the  first  part  of  the  second  year’s  tasks. 
The  first  year’s  research  provided  us  a  good  start  and  foundation  for  our  later  work. 
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Table  4.  Diameters  and  standard  deviations  of  HCT116  3D  structures  after  culturing  for  different  days. 
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ABSTRACT:  Typical  mass  spectrometric  phosphoproteome 
studies  are  complicated  by  the  need  for  large  amounts  of  starting 
material  and  extensive  sample  preparation  to  ensure  sufficient 
phosphopeptide  identifications.  In  this  paper,  we  present  a  novel 
strategy  to  perform  optimized  multistep  IMAC  enrichment  from 
whole  cell  lysates  followed  by  high-pH  reverse  phase  fractionation 
(multi-IMAC-HLB;  HLB  means  hydrophilic— lipophilic-balanced 
reversed-phase  cartridge).  The  peptide-to-IMAC  ratio  was 
optimized  to  maximize  IMAC  performance,  while  multistep 
IMAC  enrichment  enabled  improved  phosphopeptide  acquisition. 

The  addition  of  the  HLB  step  further  fractionates  the  IMAC 
enriched  phosphopeptides  while  desalting  the  samples,  which 
dramatically  reduces  the  sample  manipulation  time  and  sample 
loss  compared  to  other  popular  strategies.  We  compared  the  phosphopeptide  identification  results  of  the  multi-IMAC-HLB 
method  with  3  mg  of  starting  material  to  the  well-established  SCX-IMAC  method  with  15  mg  of  starting  material.  We  identified 
8969  unique  phosphopeptides  with  the  multi-IMAC-HLB  method,  compared  to  5519  unique  phosphopeptides  identified  with 
the  SCX-IMAC  method,  an  increase  of  62.5%.  The  increase  in  the  numbers  of  identified  phosphopeptides  is  due  to  the  increase 
in  the  ratio  of  identified  phosphopeptides  out  of  all  detected  peptides,  70.5%  with  multi-IMAC-HLB  method  compared  to  32.3% 
with  the  SCX-IMAC  method.  Multi-IMAC-HLB  is  a  robust  and  efficient  method  for  in-depth  phosphoproteomic  research. 

KEYWORDS:  phosphoproteomics,  multistep  IMAC  enrichment,  high-pH  reverse  phase  fractionation,  SCX  fractionation, 
high  pi  phosphopeptides 
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■  INTRODUCTION 

Proteins  are  widely  modified  with  posttranslational  modifica¬ 
tions  (PTMs)  such  as  phosphorylation,  methylation,  glyco- 
sylation,  and  ubiquitination.  Phosphorylation  is  a  reversible 
modification  and  is  commonly  involved  in  regulating  cellular 
processes  such  as  proliferation,  apoptosis,  differentiation, 
migration,  and  tumor  metastasis.1,2  Phosphorylation  and 
dephosphorylation  can  activate  or  deactivate  signaling  mole¬ 
cules,  thus  controlling  many  signaling  pathways.  Global 
phosphoproteomic  studies  can  provide  an  overall  image  of 
cellular  phosphorylation  status  at  certain  time  points  and  have  a 
high  potential  for  screening  biological  markers  and  targets  for 
disease  diagnosis  and  treatment.3,4  However,  because  of  the  low 
abundance  of  phosphoproteins  and  high  variability  of 
phosphorylation  sites,  in-depth  phosphoproteomic  studies 
commonly  suffer  from  two  major  problems:  (l)  the  need  to 
enrich  for  phosphoproteins/phosphopeptides  from  protein/ 
peptide  mixtures;  and  (2)  the  careful  fractionation  of  the 
protein/peptide  mixtures  to  reduce  the  sample  complexity  prior 
to  LC— MS  analysis.  Many  methods  for  phosphopeptide 
enrichment  and  fractionation  have  been  reported  including 
enrichment  with  immobilized  metal  affinity  chromatography 
(IMAC),5  polymer-based  metal  ion  affinity  capture  (Poly- 


MAC),6  titanium  dioxide  (Ti02),7  polymer-oxotitanium- 
modified  gold  wafer  (Au— P— oxoTi),8  and  fractionation  with 
strong  cation  exchange  (SCX),9  strong  anion  exchange 
(SAX),10  electrostatic  repulsion-hydrophilic  interaction  chro¬ 
matography  (ERLIC),11  and  hydrophilic  interaction  chroma¬ 
tography  (HILIC).12  The  combination  of  different  fractiona¬ 
tion  methods  with  diverse  enrichment  approaches  have  been 
utilized  in  many  phosphoproteomic  studies.13-16  In  most  of 
these  studies,  the  phosphopeptides  were  enriched  after  the 
fractionation  steps,  though  some  reports  suggest  that  the  order 
of  operations  does  not  affect  the  number  of  phosphopeptides 
identified.17 

The  combination  of  SCX  separation  followed  by  IMAC 
enrichment  is  a  widely  used  two-dimensional  approach  in 
phosphoproteomic  studies.18  ~1  This  method  typically  includes 
many  steps  including  tryptic  digestion,  an  initial  desalting  step, 
a  first  lyophilization,  SCX,  a  second  lyophilization,  a  second 
desalting  step,  a  third  lyophilization,  IMAC,  a  third  desalting 
step,  and  then  the  samples  are  analyzed  by  LC— MS  (Figure 
l).18  Many  studies  using  this  method  report  good  identification 
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Figure  1.  Workflow  of  SCX-IMAC  and  multi-IMAC-HLB  methods. 


of  large  numbers  of  phosphopeptides.19  21  However,  because 
there  are  so  many  steps  in  the  SCX-IMAC  method  and  each 
step  can  result  in  material  loss,  a  large  amount  of  starting 
material  is  required.  The  typical  starting  material  for  an  SCX- 
IMAC  procedure  is  15—20  mg  of  cell/tissue  digest,18  which  is 
difficult  to  achieve  if  working  with  precious  tissue  samples  and 
is  expensive  even  when  working  with  established  cell  lines.  The 
approach  is  even  more  expensive  when  combined  with  SILAC 
or  iTRAQ^  labeling  quantification  strategies.  Also,  it  is  hard  to 
control  the  amount  of  peptides  in  each  fraction,  which  results  in 
a  different  peptide-to-IMAC  ratio  in  each  fraction,  leading  to 
relatively  low  phosphopeptide  enrichment  efficiencies  in  some 
fractions. 

Online  high-pH  reverse  phase  (RP)  separation  is  another 
fractionation  method  used  prior  to  LC— MS  analysis  in 
proteomic  research.22-24  The  benefit  of  this  method  is  that 
samples  are  desalted  while  performing  the  fractionation,  which 
reduces  the  sample  manipulation  steps  and  decreases  the 
experimental  time  and  effort.  Currently,  to  the  best  of  our 
knowledge,  only  a  handful  of  researchers  have  reported  usin| 
the  high-pH  RP  separation  for  phosphoproteomic  research, 
and  no  research  has  been  published  comparing  the  high-pH  RP 
separation  method  with  the  popular  low-pH  separation 
methods  such  as  SCX  for  phosphoproteomic  research. 

In  this  paper,  we  optimize  the  IMAC  enrichment  conditions 
and  evaluate  the  phosphoproteome  of  the  human  mammary 
cell  line,  MCF-10A,  using  both  SCX-IMAC  and  our  optimized 
multistep  IMAC  enrichment  followed  by  the  high-pH  RP 
fractionation  method  (multi-IMAC-HLB).  We  identified  5519 
unique  phosphopeptides  in  13  fractions  using  the  SCX-IMAC 
method  from  15  mg  of  starting  material.  As  a  comparison,  using 
only  3  mg  of  starting  material,  8969  unique  phosphopeptides 
were  identified  in  12  multi-IMAC-HLB  fractions.  We  evaluate 
the  efficiency  of  the  SCX-IMAC  method,  the  multistep  IMAC 
enrichment  method,  and  the  multi-IMAC-HLB  method  by 
comparing  the  number  of  identified  unique  phosphopeptides, 
the  phosphopeptide  ratios,  the  numbers  of  singly  and  multiply 


phosphorylated  peptides,  the  numbers  of  phosphosites 
identified,  the  distribution  of  phospho-Serine  (p-Ser),  phos- 
pho-Threonine  (p-Thr),  and  phospho-Tyrosine  (p-Tyr)  sites, 
and  the  detected  charge  states  for  each  method.  We  also 
compare  the  pi  values  and  grand  average  of  hydropathy 
(GRAVY)  values  of  the  identified  phosphopeptides  with  the 
SCX-IMAC  or  the  multi-IMAC-HLB  method. 

■  MATERIALS  AND  METHODS 
Chemicals  and  Materials 

A  list  of  reagents  is  provided  in  the  Supporting  Information. 

Cell  Culture  and  Trypsin  Digestion 

MCF-10A  cells  were  cultured  in  DMEM/F12  (with  15  mM 
HEPES  buffer)  media  supplemented  with  5%  horse  serum,  0.01 
mg/mL  pf  insulin,  20  ng/mL  of  EGF,  100  ng/mL  of  cholera 
toxin,  and  500  ng/ mL  of  hydrocortisone.  After  growing  to  80% 
confluence,  cells  were  rinsed  twice  with  ice-cold  PBS  and  lysed 
with  8  M  urea  lysis  buffer,  which  contained  the  final 
concentration  of  8  M  urea,  75  mM  NaCl,  50  mM  Tris-HCl 
(pH8.2),  1  mM  NaF,  1  mM  /^-glycerophosphate,  1  mM  sodium 
orthovanadate,  10  mM  sodium  pyrophosphate,  1  mM  PMSF, 
and  1  tablet  of  EDTA-free  protease  inhibitors  cocktail  for  every 
10  mL  of  lysis  buffer.  Cell  lysates  were  sonicated  for  1  min 
followed  by  a  2  min  rest  interval,  and  this  operation  was 
repeated  for  3  times.  The  mixture  was  centrifuged  at  2500g  for 
10  min  to  remove  cell  debris.  Protein  concentrations  were 
confirmed  with  bicinchoninic  acid  (BCA)  protein  assay  kit 
(Thermo  Scientific  Pierce,  Rockford,  IL),  and  15  mg  of  protein 
was  used  in  the  SCX-IMAC  experiment,  while  3  mg  of  protein 
was  used  in  multistep  IMAC  enrichment  and  multi-IMAC-HLB 
experiments.  Proteins  were  treated  with  5  mM  DTT  for  25  min 
at  56  °C  to  reduce  their  disulfide  bonds.  Then  IAA  was  added 
to  the  protein  mixtures  to  make  a  final  concentration  of  14  mM 
and  reacted  in  the  dark  for  30  min  at  room  temperature  to 
allow  alkylation  of  the  cysteines.  Another  5  mM  DTT  was 
added  and  allowed  to  react  in  the  dark  for  15  min  to  stop  the 
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LAA  reaction.  The  protein  mixture  was  then  diluted  with  25 
mM  Tris-HCl  (pH  8.2)  to  achieve  a  final  urea  concentration  of 
1.8  M.  Proteins  were  digested  overnight  with  trypsin  from 
bovine  pancreas  (Sigma-Aldrich,  St  Louis,  MO)  at  37  °C  at  a 
50:1  protein  to  trypsin  ratio  (m/m)  in  the  presence  of  1  mM 
CaCl2.  The  digestion  reaction  was  stopped  by  adding  TFA  to  a 
final  concentration  of  0.4%  (v/v).  Samples  were  centrifuged  at 
2500g  for  10  min  to  remove  any  precipitates.  The  peptides 
were  then  desalted  by  500  mg  reverse-phase  tC18  Sep-Pak 
solid-phase  extraction  cartridges  (Waters  Corporation,  Milford, 
MA)  as  reported  previously. 

Strong  Cation  Exchange  (SCX) 

The  SCX  steps  were  performed  as  described  previously.18 
Briefly,  15  mg  of  desalted  peptides  were  resuspended  in  500  /<L 
of  SCX  Buffer  A  (30%  ACN,  7  mM  KH2P04,  pH  2.65)  and 
loaded  onto  PolySULFOETHYL  A  column  (9.4  X  200  mm,  5 
/.im  particle  size,  200  A  pore  size)  (PolyLC,  Columbia,  MD) 
with  5  injections.  The  separations  were  performed  on  a  Waters 
Alliance  e2695  System  (Waters  Corporation,  Milford,  MA)  at  a 
flow  rate  of  1  mL/min.  Liquid  chromatography  runs  (120  min 
total  time)  included  2  min  at  100%  Buffer  A,  33  min  of  a 
gradient  from  0  to  25%  Buffer  B  (30%  ACN,  7  mM  I<H2P04, 
350  mM  KC1,  pH  2.65),  1  min  of  a  jump  from  35  to  100% 
Buffer  B,  5  min  of  running  at  100%  Buffer  B,  6  min  of  washing 
with  100%  water,  11  min  of  washing  with  100%  Buffer  C  (50 
mM  K2HP04,  500  mM  NaCl,  pH  7.5),  6  min  of  washing  with 
100%  water,  and  52  min  of  equilibration  with  100%  Buffer  A 
The  chromatogram  was  recorded  at  220  and  280  nm.  The 
eluates  from  the  first  52  min  were  collected  at  4  min  intervals  to 
generate  13  fractions.  The  SCX  fractions  were  then  lyophilized 
to  reduce  the  volume  by  30%  and  desalted  with  100  mg 
reverse-phase  tC18  Sep-Pak  solid-phase  extraction  cartridges 
(Waters  Corporation,  Milford,  MA). 

IMAC  Enrichment 

For  the  SCX-IMAC  work  flow,  the  IMAC  enrichment  was 
performed  as  previously  reported.18  The  required  amounts  of 
Phos-Select  iron  affinity  gel  (IMAC  beads)  (Sigma-Aldrich,  St 
Louis,  MO)  were  washed  3  times  with  a  10X  volume  of  IMAC 
binding  buffer  (40%  ACN,  25  mM  FA  in  H20)  and 
resuspended  in  IMAC  binding  buffer  to  make  a  50%  gel  slurry. 
The  lyophilized  and  desalted  SCX  fractions  were  resuspended 
in  120  [aL  of  IMAC  binding  buffer  and  incubated  with  10  //L  of 
IMAC  slurry  at  room  temperature  for  60  min  with  vigorous 
shaking.  Incubated  IMAC  beads  were  then  washed  with  120  /aL 
of  IMAC  binding  buffer  3  times  to  remove  any  nonspecific 
binding.  Phosphopeptides  were  then  eluted  with  40  //L  elution 
buffer  (50  mM  K2HP04/NH40H,  pHIO)  3  times  by 
incubating  at  room  temperature  for  5  min  while  shaking. 
Formic  acid,  40  f.iL  of  a  10%  solution,  was  added  immediately 
after  elution  to  acidify  the  solution.  The  eluates  were  then 
lyophilized  and  desalted  with  C18  ZipTips  (Millipore,  Billerica, 
MA)  according  to  the  manufacturer’s  instructions.  For 
optimizing  IMAC  conditions,  different  amounts  of  IMAC 
slurries  were  added  to  3  mg  of  peptides  according  to  the 
indicated  peptide-to-IMAC  ratio  The  enrichment  was 

carried  out  in  the  same  way  as  described  above.  For  the 
multistep  IMAC  enrichment,  phosphopeptides  were  enriched 
from  3  mg  of  peptides  with  the  optimized  100:1  peptide-to- 
IMAC  ratio.  The  second  cycle  of  IMAC  enrichment  was 
performed  with  IMAC  flow-through  of  the  first  cycle  of  IMAC 
enrichment,  while  the  third  cycle  of  IMAC  enrichment  was 
performed  with  the  IMAC  flow-through  from  the  second  cycle 


of  IMAC  enrichment.  The  eluates  from  the  multistep  IMAC 
enrichment  were  either  desalted  with  C18  ZipTips  for  LC— 
MS/MS  analysis,  or  used  with  high-pH  reverse  phase  HLB 
desalting/ fractionation. 

High-pH  Reverse  Phase  HLB  Separation 

The  high  pH  reverse  phase  separation  was  carried  out  with 
OASIS  HLB  1  cc  Vac  Cartridge  with  30  mg  sorbent  per 
cartridge  and  30  fi m  of  particle  size  (Waters  Corporation, 
Milford,  MA).  All  buffers  used  in  HLB  separation  contained  10 
mM  NH4HC03  and  were  adjusted  to  pH  10  with  NH4OH. 
After  washing  with  3  mL  of  80%  ACN/NH4HC03  and 
equilibrating  with  3  mL  of  1%  ACN/NH4HC03,  IMAC 
enriched  samples  were  conditioned  with  0.5  mL  of  1%  ACN/ 
NH4HC03  and  loaded  onto  the  equilibrated  HLB  columns. 
The  samples  were  then  washed  with  3  mL  of  1%  ACN/ 
NH4HC03  to  remove  the  salts  introduced  during  the  IMAC 
enrichment.  Serial  elution  was  carried  out  stepwise  with  0.5  mL 
of  each  elution  buffer  in  the  order  of  5,  10,  15,  20,  25,  30,  35, 
and  80%  ACN/NH4HC03.  Eluates  were  passed  into  an 
Eppendorf  tube  containing  25  // L  of  100%  FA  to  acidify  and 
stabilize  the  phosphopeptides.  Forced  air  was  used  to  accelerate 
the  buffers  passing  through  the  cartridges.  To  reduce  the  LC— 
MS/MS  analysis  time,  fraction  numbers  were  reduced  by 
combining  the  5  and  the  25%  ACN/NH4HC03  fractions.  The 
10  and  the  30%  fractions,  the  15  and  the  35%  fractions,  and  the 
20  and  80%  fractions  were  also  combined.  The  resulting  HLB 
fractions  were  lyophilized  and  desalted  with  ZipTips  before 
LC— MS/MS  analysis.  ZipTips  were  not  used  if  working  with 
trap  column. 

Mass  Spectrometric  Analysis 

Before  liquid  chromatography  electrospray  ionization  tandem 
mass  spectrometry  (LC— ESI-MS/MS)  analysis,  all  samples 
were  lyophilized  and  resuspended  in  MS  loading  buffer  (l% 
HPLC  grade  ACN,  0.1%  FA  in  HPLC  grade  water).  The  mass 
spectrometric  analysis  was  performed  on  a  Q;Exactive  mass 
spectrometer  (Thermo  Fisher  Scientific,  Bremen,  Germany) 
coupled  with  a  nanoACQUITY  Ultra  Performance  LC  (UPLC) 
system  (Waters  Corporation,  Milford,  MA).  Peptides  were 
loaded  onto  a  C18  reverse  phase  column  (100  /im  X  100  mm, 
1.7  /(m  particle  size,  BEH130)  (Waters  Corporation,  Milford, 
MA)  with  97%  buffer  A  (0.1%  FA  in  water)  and  3%  buffer  B 
(0.1%  FA  in  ACN).  Peptide  separation  was  carried  out  with  a 
73-min  linear  gradient  from  3  to  30%  buffer  B,  then  maintained 
at  85%  buffer  B  for  5  min  and  re-equilibrated  with  3%  buffer  B 
for  10  min  before  the  next  run.  The  flow  rate  was  1200  nL/min. 
The  QTsxactive  was  equipped  with  a  nanoelectrospray  ion 
source  operating  at  a  source  voltage  of  1.8  kV  and  the  ion 
transfer  tube  temperature  of  280  °C.  The  full  MS  scans  were 
acquired  in  the  Orbitrap  mass  analyzer  with  an  m/z  range  of 
350—1800,  at  the  mass  resolution  of  70000  at  m/z=  200. 
Automatic  gain  control  (AGC)  target  value  was  set  to  1  X  106, 
with  a  maximum  fill  time  of  250  ms.  For  the  MS/MS  method, 
the  top  12  most  intense  parent  ions  were  selected  with  an 
isolation  window  of  2.0  m/z  and  fragmented  under  a 
normalized  collision  energy  (NCE)  of  30%,  with  the  AGC 
target  value  of  1  X  106  and  the  maximum  fill  time  of  120  ms. 
The  parent  ions  with  unassigned  charges  or  a  charge  state  of  z 
=  1  were  excluded  from  fragmentation.  The  intensity  threshold 
for  selection  was  set  to  8.3  X  104.  The  fragmentation  was 
performed  in  an  HCD  collision  cell  with  a  mass  resolution  of 
35000  at  m/z  =  200  and  a  dynamic  exclusion  period  of  20  s 
after  1  repeat  count.  All  samples  were  run  in  technical  duplicate. 
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Data  Analysis 

All  .raw  files  acquired  with  the  Q;Exactive  were  searched  with 
Proteome  Discoverer  1.3  software  (Thermo  Fisher  Scientific, 
Bremen,  Germany)  using  MASCOT  2.2.4  as  a  search  algorithm 
and  against  the  SwissProt  human  database  modified  by  adding 
common  contamination  sequences  (updated  on  05/2012,  86 
758  sequences).  The  peptide  false  discovery  rate  (FDR)  was 
determined  by  searching  against  the  corresponding  reverse 
database.  The  searches  were  carried  out  with  precursor  peptide 
mass  tolerance  of  10  ppm  and  fragment  ion  mass  tolerance  of 
0.02  Da,  and  allow  up  to  two  missed  cleavages  with  trypsin 
digestion.  Carbamidomethylation  of  cysteine  was  set  as  a  fixed 
modification,  while  oxidation  of  methionine  and  phosphor¬ 
ylation  of  serine,  threonine  and  tyrosine  were  set  as  variable 
modifications.  Search  results  were  screened  with  an  FDR  value 
of  0.01,  and  phosphopeptides  with  PhosphoRS  score  higher 
than  0.99  were  considered  to  have  confident  phosphosite 
localization.  Peptides  with  the  same  sequence  but  different 
oxidation  states  were  treated  as  the  same  peptide  considering 
that  the  oxidation  could  have  been  introduced  during  sample 
manipulation.  The  peptide  lists  were  exported  to  Microsoft 
Excel  and  processed  with  Excel  or  in-house  scripts.  The  4-circle 
Venn  diagrams  were  generated  by  the  online  tool  Venny 
(Oliveros,  J.C.  2007,  http://bioinfogp.cnb.csic.es/tools/venny/ 
index.html).  The  3-circle  proportional  Venn  diagrams  were 
generated  by  CircleApplet,  which  can  be  down  loaded  from 
https://www.cs.kent.ac.uk/people/staff/pjr/EulerVennCircles/ 
EulerVennApplet.html.  The  GRAVY  values  were  calculated  by 
the  GRAVY  calculator  (http://www.gravy-calculator.de/).  The 
pi  values  of  phosphopeptides  were  calculated  using  plCalcu- 
lator  (https://trac.nbic.nl/picalculator/).26  ExPASy  pIC,  values 
of  amino  acids  were  used  for  the  calculation,  and  pIC,  values  for 
the  phospho-  groups  were  set  as  pI<Cal  =  2.12  and  pICa2  =  7.21. 

■  RESULTS  AND  DISCUSSION 

Phosphopeptide  Ratio  Is  Dependent  on  Peptide-to-IMAC 
Ratio 

As  a  pre-experiment  and  a  comparison  standard,  we  used  the 
well-established  method  that  combines  SCX  and  IMAC 
enrichment  to  detect  phosphopeptides  in  the  MCF-10A  cell 
line.  Thirteen  SCX  fractions  were  collected  and  further 
enriched  with  IMAC  beads  as  previously  reported.18  (Support¬ 
ing  Information  Figure  Si)  With  this  method,  we  successfully 
detected  5519  unique  phosphopeptides  and  4989  unique 
phosphosites  with  confident  localization  (with  phosphoRS 
score  >0.99)  (Supporting  Information  Figure  S2  and  Table 
Si).  However,  the  main  drawback  of  the  SCX- IMAC  method  is 
the  need  for  a  large  amount  of  starting  material  to  ensure 
sufficient  numbers  of  identified  phosphopeptides;  typically 
more  than  15  mg  of  starting  materials  is  required.  Possible 
reasons  for  the  large  amount  of  starting  material  include  the 
following:  (l)  phosphopeptides  are  a  relatively  small 
percentage  of  the  total  number  of  peptides  and  therefore 
require  large  amounts  of  material  to  ensure  the  detection  of 
phosphopeptides  with  low  abundance,  and  (2)  there  are 
numerous  steps  in  the  SCX-IMAC  enrichment  method,  with 
each  step  resulting  in  sample  loss.  Another  problem  we 
observed  in  our  data  sets  is  the  uneven  distribution  of 
phosphopeptide  ratios  in  each  SCX  fraction  (Supporting 
Information  Figure  SIB  and  Table  Si).  The  extremely  low 
phosphopeptide  ratio  in  some  SCX  fractions  reduced  the 
overall  phosphopeptide  ratio  to  32.4%  with  SCX-IMAC 


method  (see  Figure  5B  and  Supporting  Information  Table 
S5).  We  considered  a  possible  reason  for  this  low  ratio  might  be 
that  the  peptide-to-IMAC  ratio  in  each  SCX  fraction  is  not 
optimized.  It  was  reported  previously  that  with  the  TiOz 
enrichment,  the  peptide-to-Ti02  ratio  is  crucial  for  the 
phosphopeptide  enrichment  efficiency.27  We  surmised  that 
peptide-to-IMAC  ratio  might  also  play  an  important  role  for 
IMAC  performance.  We  tested  10  different  peptide-to-IMAC 
ratios  as  shown  in  Figure  2.  The  results  show  that  with  the 


Figure  2.  Optimization  of  peptide-to-IMAC  ratio  for  IMAC 
enrichment.  Percentage  of  identified  phosphopeptides  under  different 
peptide-to-IMAC  ratios  (//g://L)  are  shown  in  the  bar  graph.  Identified 
total  peptide  numbers  under  each  condition  are  shown  with  the 
dashed  line. 


100:1  peptide-to-IMAC  ratio,  62.3%  of  the  total 

peptides  retained  were  phosphopeptides  (Figure  2,  Supporting 
Information  Table  S2).  At  higher  peptide-to-IMAC  ratios,  there 
might  not  be  sufficient  IMAC  beads  for  enrichment,  while  at 
lower  peptide-to-IMAC  ratios,  the  nonspecific  attachment  of 
nonphosphopeptides  to  IMAC  beads  might  occur.  As  an 
example,  with  a  3:1  peptide-to-IMAC  ratio,  6025  total  peptides 
were  identified  compared  with  3023  peptides  with  a  100:1 
peptide-to-IMAC  ratio.  However,  only  244  phosphopeptides 
(4.6%  of  total)  were  identified  with  a  3:1  peptide-to-IMAC 
ratio,  showing  a  dramatic  decrease  compared  with  1827 
phosphopeptides  (62.3%  of  total)  identified  with  a  100:1 
ratio  (Figure  2,  Supporting  Information  Table  S2). 

Considering  the  large  amount  of  required  starting  materials, 
complicated  manipulation  steps,  and  difficulties  defining  the 
total  amount  of  peptides  existing  in  each  SCX  fraction,  we 
decided  to  perform  enrichment  directly  from  the  whole  cell 
lysate.  An  advantage  of  this  strategy  is  that  we  knew  the  starting 
peptide  amount  and  thus  could  optimize  the  IMAC  enrich¬ 
ment.  In  addition,  we  anticipated  that  skipping  the  SCX 
fractionation  steps  and  working  with  the  optimized  multistep 
IMAC  enrichment  would  simplify  the  overall  sample 
preparation  process  and  reduce  sample  loss.  As  a  result,  fewer 
starting  materials  were  required,  and  experimental  time  was 
reduced. 

Multistep  IMAC  Enrichment  Directly  from  Whole  Cell 
Lysates 

To  decrease  the  amount  of  starting  materials  needed  for 
phosphopeptide  identification  and  to  increase  the  phosphopep¬ 
tide  enrichment  efficiency,  we  performed  direct  enrichment 
with  IMAC  beads  from  whole  cell  lysate.  As  starting  material,  3 
mg  of  MCF-10A  cell  lysate  were  used  for  the  IMAC 
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Figure  3.  Distribution  of  phosphopeptides  identified  by  multistep  IMAC  enrichment.  (A)  Numbers  of  identified  phosphopeptides  in  the  first, 
second,  and  third  round  of  IMAC  enrichment  with  phospho-serine  (blue),  phospho-threonine  (green),  or  phospho-tyrosine  (red)  sites,  and  those 
without  confident  localization  (phosphoRS  score  <99%,  dark  purple)  are  shown  in  the  bar  graph.  The  percentage  of  identified  phosphopeptides  from 
the  total  identified  peptides  in  each  round  of  IMAC  enrichment  are  shown  with  the  dashed  line.  (B)  Numbers  of  identified  phosphopeptides 
(phosphoRS  score  >99%)  with  1,  2,  3,  or  4  phosphate  groups  are  shown  in  the  bar  graph  with  purple,  blue,  yellow  and  red  colors.  Unique 
phosphosite  numbers  are  calculated  from  phosphopeptides  with  phosphoRS  score  >99%  and  are  shown  with  the  dashed  line.  (C)  Overlap  of  the 
nonredundant  phosphopeptides  identified  in  each  round  of  IMAC  enrichment  are  shown  in  the  Venn  diagram.  (D)  Charge  states  of 
phosphopeptides  identified  in  each  round  of  IMAC  enrichment  are  shown  in  the  pie  chart. 


enrichment.  According  to  the  optimized  peptide-to-IMAC 
ratio,  we  added  30  /.iL  of  the  IMAC  bead  slurry  to  the  peptide 
mixture  and  incubated  for  1  h  at  room  temperature.  The  results 
showed  that  with  the  first  round  of  IMAC  enrichment  (IMAC- 
l),  a  total  of  4661  peptides  were  identified  from  3  mg  of  cell 
lysate  within  a  180  min  LC— MS/MS  run  (with  a  150  min 
gradient),  and  among  these  peptides,  3699  were  phosphopep¬ 
tides,  with  a  phosphopeptide  enrichment  ratio  of  79.4% 
(Supporting  Information  Table  S3,  Figure  3A).  PhosphoRS 
was  used  to  evaluate  the  accuracy  of  phosphosites  local¬ 
ization.28  As  reported  previously,  phosphopeptides  with 
PhosphoRS  score  >99%  are  considered  to  have  confident 
phosphosite  localization.-8  We  only  counted  phosphopeptides 
with  confident  localization  for  mono-  or  multiphosphopeptide 
identification,  as  well  as  for  calculating  total  number  of 
phosphosites  identified  in  all  species.  IMAC-1  enrichment 
generated  2608  phosphopeptides  with  confident  localization 
information,  among  which  2166  were  monophosphopeptides, 
while  442  were  multiphosphopeptides  (Supporting  Information 
Table  S3,  Figure  3B). 

To  evaluate  how  complete  a  single  IMAC  enrichment  is  at 
extracting  phosphopeptides  from  the  whole  cell  lysate,  we  then 
carried  out  a  second  round  of  IMAC  enrichment  (IMAC-2) 
from  the  flow-through  of  IMAC-1.  We  repeated  the  process  for 


a  third  round  (IMAC-3),  with  IMAC  enrichment  from  the 
flow-through  of  IMAC-2.  In  IMAC-2  enrichment,  3079  total 
phosphopeptides  were  identified.  Of  these,  1791  phosphopep¬ 
tides  were  also  identified  with  IMAC-1,  while  1288 
phosphopeptides  were  unique  to  IMAC-2.  With  IMAC-3 
enrichment,  a  total  number  of  1531  phosphopeptides  were 
identified,  among  which  947  phosphopeptides  that  had  been 
identified  with  IMAC-1  enrichment  and  1252  phosphopeptides 
that  overlapped  with  IMAC-2  enrichment.  Only  an  additional 
223  unique  phosphopeptides  were  detected  with  IMAC-3 
enrichment  (Figure  3C),  indicating  that  three  cycles  of  IMAC 
enrichment  would  be  enough  to  enrich  most  of  the 
phosphopeptides  from  the  whole  cell  lysates. 

The  multistep  IMAC  enrichment  increased  the  number  of 
identified  phosphopeptides  from  3699  unique  phosphopeptides 
with  a  single  IMAC  enrichment  to  4727  unique  phosphopep¬ 
tides  (Supporting  Information  Table  S5),  an  increased  ratio  of 
27.8%.  In  addition,  in  each  enrichment  cycle,  the  distribution  of 
enriched  phosphopeptides  was  different.  Most  of  the  multiply 
phosphorylated  peptides,  a  total  of  442,  were  enriched  in  the 
first  round  of  IMAC  enrichment.  In  the  second  round  of  IMAC 
enrichment,  only  9  multiply  phosphorylated  phosphopeptides 
were  detected  each  containing  2  phosphate  groups,  while  in  the 
third  round  of  IMAC  enrichment,  all  identified  phosphopep- 
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Figure  4.  Distribution  of  phosphopeptides  identified  by  the  multi-IMAC-HLB  method.  (A)  Number  of  identified  phosphopeptides  in  each  HLB 
fraction  with  phospho-serine  (blue),  phospho-threonine  (green),  or  phospho-tyrosine  (red)  sites,  and  those  without  confident  localization 
(phosphoRS  score  <99%,  dark  purple)  are  shown  in  the  bar  graph.  Percentages  of  identified  phosphopeptides  from  the  total  number  of  identified 
peptides  in  each  HLB  fraction  are  shown  with  the  dashed  line.  (B)  Numbers  of  identified  phosphopeptides  (phosphoRS  score  >99%)  in  each  HLB 
fraction  with  1,  2,  3,  or  4  phosphate  groups  are  shown  in  purple,  blue,  yellow  and  red  colors.  Unique  phosphosite  numbers  in  each  HLB  fraction  are 
calculated  from  phosphopeptides  with  phosphoRS  score  >99%  and  are  shown  with  the  dashed  line.  (C,  D,  E)  Overlap  of  the  nonredundant 
phosphopeptides  identified  in  each  HLB  fraction  of  the  first  (C),  second  (D),  and  third  (E)  round  of  IMAC  enrichment  are  shown  in  the  Venn 
diagram. 


tides  were  singly  phosphorylated  (Figure  3B,  Supporting 
Information  Table  S3).  However,  the  distribution  of  p-Ser,  p- 
Thr,  and  p-Tyr  phosphosites,  and  the  distribution  of  charge 
states  of  identified  phosphopeptides  did  not  vary  among  the 
three  enrichment  cycles  (Figure  3A,D,  Supporting  Information 
Table  S3). 

Further  Fractionation  of  Multistep  IMAC  Enrichment 
Fractions  with  High-pH  HLB  (Multi-IMAC-HLB)  Increases 
Phosphopeptide  Identification 

Since  the  multistep  IMAC  enrichment  method  generates  only  3 
fractions,  it  might  not  be  a  suitable  method  for  in-depth 
phosphopeptide  identification.  To  maximize  the  number  of 
phosphopeptides  identified,  we  added  another  fractionation 
method  after  the  IMAC  enrichment  steps  to  further  simplify 
the  samples.  In  addition,  the  IMAC  process  introduces  salt  to 
the  samples  so  a  desalting  step  is  required  before  LC— MS/MS 
analysis.  In  order  to  reduce  the  number  of  manipulation  steps 
and  to  decrease  sample  loss,  we  added  a  second  dimension 
separation  while  desalting.  We  chose  high-pH  reverse  phase 
separation  as  our  second  dimension  method.  Because  the 


normal  RP  packing  materials  cannot  sustain  a  pH  as  high  as  10, 
we  used  HLB  cartridges  that  work  similarly  to  reverse  phase 
cartridges  but  are  more  suitable  and  efficient  for  peptide 
purification,  and  are  stable  under  high  pH.  Phosphopeptides 
from  each  cycle  of  the  multistep  IMAC  enrichment  were 
applied  to  the  HLB  cartridges  separately.  Salt  was  removed  by 
washing,  and  the  peptides  were  eluted  from  the  cartridge 
stepwise  with  increasing  acetonitrile  concentrations  under  high 
pH  conditions.  In  order  to  decrease  the  instability  of  the 
phosphopeptides,  the  collection  tubes  were  filled  with  25  f- iL  of 
formic  acid  in  advance,  so  that  the  eluted  phosphopeptides 
would  be  protonated  immediately  after  elution.  It  is  reported 
that  high  pH  LC  chromatograms  have  complementary 
characteristics  to  low  pH  chromatography.25  While  typically 
the  LC  solvents  are  acidic,  the  high  pH  for  the  second 
dimension  separation  is  expected  to  provide  good  orthogonality 
for  the  LC— MS/MS  analysis.  In  order  to  decrease  LC— MS/MS 
run  time,  we  mixed  the  HLB  fractions  eluted  with  5  and  25% 
ACN,  10  and  30%  ACN,  15  and  35%  ACN,  and  20  and  80% 
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Table  1.  Summary  of  Phosphopeptides  Identified  with  SCX-IMAC,  Multi-IMAC,  or  Multi-IMAC-HLB  Methods,  and 
Comparison  of  the  Efficiency  of  These  Different  Workflows 


workflow 

total 

phosphopeptides 

nonredundant 

phosphopeptides 

p-Ser 

P- 

Thr 

P- 

Tyr 

starting 

material 

(mg) 

sample 

preparation  time 
(day) 

LC -MS/MS 
time  (h)“ 

phosphopeptides  identified 
per  LC— MS/MS  hour 

SCX-IMAC 

7907 

SS19 

4658 

306 

25 

15 

7 

44 

125 

Multi- 

IMAC 

8309 

4727 

3350 

309 

26 

3 

3 

18 

263 

Multi- 

22785 

8969 

5655 

626 

55 

3 

4 

40 

224 

IMAC- 

HLB 


“Time  required  for  duplicate  LC— MS/MS  runs  for  each  sample. 


Figure  S.  Comparison  of  phosphopeptide  identifications  from  the  SCX-IMAC,  multistep  IMAC,  and  multi-IMAC-HLB  methods.  (A)  Numbers  of 
identified  phosphopeptides  in  each  method  with  phosphoRS  score  >99%  are  shown  in  the  bar  graph.  Phosphopeptides  (phosphoRS  score  >99%) 
with  1,  2,  3,  or  4  and  5  phosphate  groups  are  shown  in  purple,  blue,  yellow  and  red  colors.  Unique  phosphosite  numbers  confirmed  in  each  method 
are  calculated  from  phosphopeptides  with  phosphoRS  score  >99%  and  are  shown  with  the  dashed  line.  (B)  Numbers  of  phosphopeptides  with 
phospho-serine  (blue),  phospho-threonine  (green),  or  phospho-tyrosine  (red)  sites,  and  those  without  confident  localization  (phosphoRS  score 
<99%,  dark  purple)  are  shown  in  the  bar  graph.  Percentages  of  identified  phosphopeptides  from  the  total  identified  peptides  from  each  method  are 
shown  with  the  dashed  line.  (C)  Overlap  of  nonredundant  phosphopeptides  identified  in  each  method  are  shown  in  the  Venn  diagram. 


ACN,  generating  4  HLB  fractions  for  each  IMAC  enrichment 
cycle. 

The  multi-IMAC-HLB  method  generated  12  total  fractions, 
and  each  of  these  fractions  were  analyzed  by  LC— MS/MS  for  a 
100  min  run  with  73  min  gradient  of  3—30%  ACN,  the  same 
LC— MS  conditions  as  the  SCX-IMAC  method.  The  results 
showed  that  with  the  additional  HLB  fractionation,  the  total 
number  of  identified  phosphopeptides  increased  to  8969  as 
compared  to  the  4727  phosphopeptides  from  the  SCX-IMAC 
analysis  (Supporting  Information  Table  S5).  The  number  of 
phosphopeptides  with  confident  phosphosite  localization 
increased  from  3225  to  5328,  while  the  total  identified 
phosphosites  also  increased  from  3686  to  6337  (Supporting 
Information  Table  S5),  nearly  twice  as  much  as  without  HLB 
fractionation.  Most  of  the  IMAC-HLB  fractions  showed  a 


relatively  high  phosphopeptide  ratio,  around  70—80%,  with  a 
maximum  of  91.2%  (Supporting  Information  Table  S4,  Figure 
4A).  As  shown  in  Figure  3B,  most  multiply  phosphorylated 
peptides  were  detected  following  the  first  cycle  of  IMAC 
enrichment,  while  with  the  second  and  third  rounds  of  IMAC 
enrichments,  only  a  total  number  of  9  multiply  phosphorylated 
peptides  (with  2  phosphates)  were  identified.  However,  with 
HLB  fractionation,  the  proportion  of  multiply  phosphorylated 
peptides  from  IMAC-2  and  IMAC-3  enrichment  was  increased 
(Supporting  Information  Table  S4,  Figure  4B).  This  fact 
indicates  that  in  IMAC-2  and  IMAC-3  enrichment,  there  are 
still  multiply  phosphorylated  peptides  that  can  be  enriched,  and 
that  the  lack  of  detection  is  largely  limited  by  the  complexity  of 
the  sample.  Careful  further  fractionation  could  help  to  improve 
the  identification  of  these  multiply  phosphorylated  peptides. 
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Figure  6.  Comparison  of  phosphopeptide  characteristics  identified  by  the  SCX-IMAC  or  multi-IMAC-HLB  method.  (A)  Histogram  of  identified 
phosphopeptides  with  different  p/  value  ranges  by  the  SCX-IMAC  (blue)  or  multi-IMAC-HLB  (pink)  method  are  shown  in  the  bar  graph. 
Cumulative  percentage  of  phosphopeptides  from  low  to  high  p/  value  ranges  are  shown  with  the  dashed  lines.  (B)  Histogram  of  identified 
phosphopeptides  with  different  GRAVY  value  ranges  by  SCX-IMAC  (blue)  or  multi-IMAC-HLB  (pink)  method  are  shown  in  bar  graph.  Cumulative 
percentage  of  phosphopeptides  from  low  to  high  GRAVY  value  ranges  are  shown  with  the  dashed  lines.  (C,  D,  E)  Charge  states  of  phosphopeptides 
identified  in  SCX-IMAC  (C),  multistep  IMAC  (D),  and  multi-IMAC-HLB  (E)  methods  are  shown  in  the  pie  chart. 


The  percentage  of  modified  p-Ser,  p-Thr,  and  p-Tyr  did  not 
show  significant  differences  among  the  12  IMAC-HLB  fractions 
(Supporting  Information  Table  S4,  Figure  4A).  The  fractiona¬ 
tion  efficiency  of  HLB  is  shown  in  Figure  4C— E.  We  detected 
unique  phosphopeptides  in  every  HLB  sample,  indicating  good 
efficiency  of  HLB  fractionation. 

Multi-IMAC-HLB  Method  Provides  Higher  Numbers  of 
Identified  Phosphopeptides,  Using  Less  Starting  Material, 
than  the  SCX-IMAC  Method 

We  compared  the  number  of  phosphopeptides  identified  with 
SCX-IMAC  method,  the  multistep  IMAC  enrichment,  and  the 
multi-IMAC-HLB  method  (Table  l).  The  multistep  IMAC 
enrichment  and  multi-IMAC-HLB  method  require  3  mg  of 
starting  material,  much  less  than  the  15  mg  needed  for  SCX- 
IMAC  method.  However,  the  number  of  identified  phospho¬ 
peptides  is  greater  with  the  multi-IMAC-HLB  method,  with 
5519  phosphopeptides  identified  with  SCX-IMAC  compared  to 
8969  phosphopeptides  identified  with  the  multi-IMAC-HLB 
method  (Supporting  Information  Table  S5,  Figure  5A).  The 
increase  in  the  numbers  of  identified  phosphopeptides  is  not 


because  of  the  increase  in  the  overall  identified  peptides  (17060 
with  SCX-IMAC  method  and  12718  with  multi-IMAC-HLB 
method)  but  is  due  to  the  increase  in  the  ratio  of  identified 
phosphopeptides  out  of  all  detected  peptides.  The  phospho¬ 
peptide  ratio  increased  from  32.4%  with  SCX-IMAC  method  to 
70.5%  with  multi-IMAC-HLB  method  (Supporting  Informa¬ 
tion  Table  S5,  Figure  5B).  This  difference  can  be  attributed  to 
the  optimized  IMAC  conditions  in  multi-IMAC-HLB  method. 
The  ratio  of  p-Thr  and  p-Tyr  residues  was  also  increased  with 
the  multi-IMAC-HLB  method  as  well  as  the  multiply 
phosphorylated  peptide  percentage. 

The  overlap  of  these  3  methods  showed  that  most  of  the 
phosphopeptides  (64.2%)  identified  in  multistep  IMAC 
enrichment  were  also  identified  with  the  multi-IMAC-HLB 
method,  although  some  phosphopeptides  were  lost  during  the 
treatment  process  (Figure  5C).  The  SCX-IMAC  method  has 
only  38.6%  overlap  with  the  multistep  IMAC  and  multi-IMAC- 
HLB  methods,  indicating  that  SCX  and  HLB  fractionation 
might  have  different  characteristics  and  they  might  identify 
different  populations  of  phosphopeptides. 
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SCX-IMAC  and  Multi-IMAC-HLB  Methods  Identify  Different 
Populations  of  Phosphopeptides 

Because  SCX  and  HLB  separate  peptides  by  dissimilar  chemical 
principals  and  at  different  pH  values,  we  compared  the 
characteristics  of  phosphopeptides  identified  with  these  two 
distinct  fractionation  methods.  The  results  show  that  with  the 
SCX  fractionation  method,  most  of  the  identified  phosphopep¬ 
tides  are  acidic  phosphopeptides  with  low  pi  values.  The 
highest  portion  of  phosphopeptides,  1906  in  total,  has  pi  values 
ranging  from  3.5  to  4  (Figure  6A,  Table  2A).  For 


Table  2.  Comparison  of  pi  Value  and  GRAVY  Ranges  for 
Phosphopeptides  Identified  with  SCX-IMAC  or  Multi- 
IMAC-HLB  Method 


(A)  Distribution  of  phosphopeptides  of  different  p  1  value  ranges  identified  by 
SCX-IMAC  or  Multi-IMAC-HLB  method. 

SCX-IMAC  Multi-IMAC-HLB 


pi  values 

phosphopeptide 

(#) 

cumulative 

(%) 

phosphopeptide 

(#) 

cumulative 

(%) 

1.5 

0 

0.0 

0 

0.0 

2 

2 

0.0 

0 

0.0 
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29 

0.6 

32 

0.4 

3 

109 

2.5 

292 

3.6 

3.5 

840 

17.8 
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10.6 

4 

1906 

52.3 

1791 

30.5 

4.5 

1044 

71.2 

1787 

50.5 

5 

247 

75.7 

681 

58.1 

5.5 

645 

87.4 

1084 

70.2 

6 

289 

92.6 

859 

79.8 
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165 

95.6 

470 

85.0 

7 

23 

96.0 

210 

87.3 
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99.8 

916 

97.6 

8 

1 

99.8 

7 
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1 
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28 

97.9 

9 
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100.0 
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0 
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3 
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(B)  Distribution  of  phosphopeptides  of  different  GRAVY  ranges  identified  by 
SCX-IMAC  or  Multi-IMAC-HLB  method. 

SCX-IMAC 

Multi-IMAC-HLB 

GRAVY 

phosphopeptide 

(#) 

cumulative 

(%) 

phosphopeptide 

(#) 

cumulative 

(%) 

-3.5 

0 

0.0 

0 

0.0 

-3 

21 

0.4 

0 

0.0 

-2.5 

106 

2.3 

17 

0.2 

-2 

417 

9.9 

182 

2.2 

-1.5 

1070 

29.2 

997 

13.3 

-1 

ISOS 

56.5 

2526 

41.5 
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1434 

82.5 

3024 

75.2 

0 
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95.4 
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phosphopeptides  with  pi  values  lower  than  4,  2886 
phosphopeptides  were  identified  with  SCX-IMAC  method, 
while  2737  phosphopeptides  were  identified  with  the  multi- 
IMAC-HLB  method,  showing  no  difference  between  these  two 
methods.  However,  the  phosphopeptides  with  pi  values  higher 
than  4  increased  from  2633  (47.7%  of  total)  with  the  SCX- 


IMAC  method  to  6232  (69.5%  of  total)  with  the  multi-IMAC- 
HLB  method  (Table  2A).  For  the  basic  phosphopeptides  with 
pi  values  higher  than  7,  only  220  (4.0%  of  total) 
phosphopeptides  were  identified  with  SCX-IMAC  method, 
while  this  number  increased  to  1135  (12.7%  of  total)  with  the 
multi-IMAC-HLB  method,  a  more  than  5  times  increase  (Table 
2A).  The  multi-IMAC-HLB  method  even  identified  184 
phosphopeptides  with  pi  values  higher  than  8.5,  while  the 
SCX-IMAC  method  only  identified  9.  The  dramatic  increase  in 
identifications  for  phosphopeptides  with  higher  pi  values  likely 
results  from  the  decrease  of  high  pi  peptide  loss  when 
performing  the  high-pH  HLB  separation.  With  the  SCX 
method,  under  low  pH,  phosphopeptides  with  higher  pi  values 
are  likely  to  be  protonated,  thus  having  a  higher  affinity  to  SCX 
resin.  In  order  to  elute  these  phosphopeptides,  higher  salt 
concentrations  are  required.  However,  high  concentrations  of 
salt  make  it  difficult  to  solubilize  the  peptides.  As  a  result, 
phosphopeptides  with  high  pi  values  are  difficult  to  recover 
from  the  SCX  column  and  are  readily  lost  during  the  SCX 
process.  By  contrast,  with  the  high-pH  HLB  separation,  there 
are  no  such  problems,  and  these  phosphopeptides  can  be  more 
efficiently  recovered. 

The  GRAVY  values  also  showed  differences  between  the  two 
methods.  With  the  SCX-IMAC  method,  544  (9.9%  of  total) 
phosphopeptides  were  identified  with  GRAVY  values  lower 
than  —2,  while  with  the  multi-IMAC-HLB  method  only  199 
(2.2%  of  total)  were  identified  (Figure  6B,  Table  2B).  However, 
for  phosphopeptides  with  GRAVY  values  higher  than  —1.5,  the 
multi-IMAC-HLB  method  showed  a  dramatic  advantage  and 
increased  the  phosphopeptide  number  from  3905  (70.8%  of 
total)  to  7773  (86.7%  of  total)  (Table  2B).  The  reason  for 
these  differences  would  be  due  to  the  fact  that  SCX  fractionates 
phosphopeptides  mainly  by  their  charge  states,  and  the 
separation  is  performed  under  hydrophilic  conditions,  so  that 
less  hydrophilic  phosphopeptides  would  be  lost  in  the  flow 
through  during  the  separation.  By  comparison,  HLB  separates 
phosphopeptides  mainly  by  their  hydrophobicity  with  increas¬ 
ing  acetonitrile  percentage.  Phosphopeptides  with  higher 
hydrophobicity  are  separated  from  the  hydrophilic  ones;  thus, 
hydrophobic  phosphopeptides  have  a  greater  chance  to  be 
fractionated  and  identified. 

The  charge  states  for  the  identified  phosphopeptides  were 
also  substantially  different  between  the  SCX-IMAC  and  multi- 
IMAC-HLB  data  sets.  In  the  SCX-IMAC  method,  3700 
phosphopeptides  (67.0%  of  total)  exist  in  the  2+  charge  state, 
while  in  the  multi-IMAC-HLB  method  only  2895  phosphopep¬ 
tides  (34.4%  of  total)  were  found  at  2+  charge  state.  With  the 
SCX-IMAC  method,  1755  phosphopeptides  (31.8%  of  total) 
were  found  at  3+  charge  state,  and  this  number  increased  to 
4723  (52.7%  of  total)  with  the  multi-IMAC-HLB  method 
(Figure  6C-E,  Supporting  Information  Table  S5). 

All  the  above-mentioned  differences  indicate  that  multi- 
IMAC-HLB  and  SCX-IMAC  identify  different  populations  of 
phosphopeptides.  Because  most  of  the  fractionation  methods 
used  in  phosphoproteomic  studies  are  performed  under  very 
acidic  conditions  and  separate  phosphopeptides  by  hydrophilic 
interactions,  the  high-pH  HLB  fractionation  method  is 
complementary  to  other  fractionation  methods  such  as  SCX, 
HILIC,  or  ERLIC  to  increase  the  overall  phosphopeptide 
identification. 
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■  CONCLUSION 

In  this  study,  we  optimized  the  peptide-to-IMAC  beads  ratio 
for  IMAC  best  performance.  Three  rounds  of  optimized  IMAC 
enrichment  were  shown  to  be  sufficient  to  enrich  most  of  the 
phosphopeptides  from  whole  cell  mammalian  lysates.  Further 
fractionation  with  the  high-pH  HLB  cartridges  provided  higher 
numbers  of  phosphopeptides  and  phosphosites  and  also 
increased  the  identified  p-Thr  and  p-Tyr  ratios,  with 
significantly  less  starting  material  required  compared  to  SCX- 
IMAC  method.  The  multi-IMAC-HLB  technique  is  relatively 
quick  and  easy  to  perform.  Because  the  multi-IMAC-HLB 
method  separates  phosphopeptides  in  a  high-pH  range  by 
differences  in  hydrophobicity,  while  most  of  the  other 
fractionation  methods  perform  at  low-pH  conditions,  the 
multi-IMAC-HLB  method  shows  dramatic  improvement  for 
identification  of  phosphopeptides  with  higher  pi  and  GRAVY 
values.  The  charge  states  of  identified  phosphopeptides  with 
SCX-IMAC  or  multi-IMAC-HLB  method  are  also  significantly 
different.  SCX-IMAC  mostly  enriched  phosphopeptides  with  a 
2+  charge  state,  while  multi-IMAC-HLB  preferentially  enriched 
phosphopeptides  with  a  3+  charge  state.  In  conclusion,  multi- 
IMAC-HLB  method  is  a  relatively  easy  and  efficient  approach 
to  enrich  for  phosphopeptides,  especially  basic  and  hydro- 
phobic  phosphopeptides,  from  less  starting  material  than  is 
required  by  other  commonly  used  approaches. 
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